Search CORE

8 research outputs found

From Simple to Complex: A Progressive Framework for Document-level Informative Argument Extraction

Author: Huang Quzhe
Zhang Yanxi
Zhao Dongyan
Publication venue
Publication date: 25/10/2023
Field of study

Document-level Event Argument Extraction (EAE) requires the model to extract arguments of multiple events from a single document. Considering the underlying dependencies between these events, recent efforts leverage the idea of "memory", where the results of already predicted events are cached and can be retrieved to help the prediction of upcoming events. These methods extract events according to their appearance order in the document, however, the event that appears in the first sentence does not mean that it is the easiest to extract. Existing methods might introduce noise to the extraction of upcoming events if they rely on an incorrect prediction of previous events. In order to provide more reliable memory, we propose a simple-to-complex progressive framework for document-level EAE. Specifically, we first calculate the difficulty of each event and then, we conduct the extraction following a simple-to-complex order. In this way, the memory will store the most certain results, and the model could use these reliable sources to help the prediction of more difficult events. Experiments on WikiEvents show that our model outperforms SOTA by 1.4% in F1, indicating the proposed simple-to-complex framework is useful in the EAE task.Comment: Accepted to the Findings of EMNLP 2023 (Long Paper

arXiv.org e-Print Archive

Knowledge-enhanced Iterative Instruction Generation and Reasoning for Knowledge Base Question Answering

Author: Du Haowei
Huang Quzhe
Zhang Chen
Zhao Dongyan
Publication venue
Publication date: 07/09/2022
Field of study

Multi-hop Knowledge Base Question Answering(KBQA) aims to find the answer entity in a knowledge base which is several hops from the topic entity mentioned in the question. Existing Retrieval-based approaches first generate instructions from the question and then use them to guide the multi-hop reasoning on the knowledge graph. As the instructions are fixed during the whole reasoning procedure and the knowledge graph is not considered in instruction generation, the model cannot revise its mistake once it predicts an intermediate entity incorrectly. To handle this, we propose KBIGER(Knowledge Base Iterative Instruction GEnerating and Reasoning), a novel and efficient approach to generate the instructions dynamically with the help of reasoning graph. Instead of generating all the instructions before reasoning, we take the (k-1)-th reasoning graph into consideration to build the k-th instruction. In this way, the model could check the prediction from the graph and generate new instructions to revise the incorrect prediction of intermediate entities. We do experiments on two multi-hop KBQA benchmarks and outperform the existing approaches, becoming the new-state-of-the-art. Further experiments show our method does detect the incorrect prediction of intermediate entities and has the ability to revise such errors.Comment: Accepted by NLPCC 2022(oral

arXiv.org e-Print Archive

More than Classification: A Unified Framework for Event Temporal Relation Extraction

Author: Feng Yansong
Hu Yutong
Huang Quzhe
Liu Chang
Zhao Dongyan
Zhu Shengqi
Publication venue
Publication date: 27/05/2023
Field of study

Event temporal relation extraction~(ETRE) is usually formulated as a multi-label classification task, where each type of relation is simply treated as a one-hot label. This formulation ignores the meaning of relations and wipes out their intrinsic dependency. After examining the relation definitions in various ETRE tasks, we observe that all relations can be interpreted using the start and end time points of events. For example, relation \textit{Includes} could be interpreted as event 1 starting no later than event 2 and ending no earlier than event 2. In this paper, we propose a unified event temporal relation extraction framework, which transforms temporal relations into logical expressions of time points and completes the ETRE by predicting the relations between certain time point pairs. Experiments on TB-Dense and MATRES show significant improvements over a strong baseline and outperform the state-of-the-art model by 0.3\% on both datasets. By representing all relations in a unified framework, we can leverage the relations with sufficient data to assist the learning of other relations, thus achieving stable improvement in low-data scenarios. When the relation definitions are changed, our method can quickly adapt to the new ones by simply modifying the logic expressions that map time points to new event relations. The code is released at \url{https://github.com/AndrewZhe/A-Unified-Framework-for-ETRE}

arXiv.org e-Print Archive

MC^2: A Multilingual Corpus of Minority Languages in China

Author: Chen Zhibin
Feng Yansong
Huang Quzhe
Lin Jiuheng
Tao Mingxu
Zhang Chen
Publication venue
Publication date: 14/11/2023
Field of study

Large-scale corpora play a vital role in the construction of large language models (LLMs). However, existing LLMs exhibit limited abilities in understanding low-resource languages, including the minority languages in China, due to a lack of training data. To improve the accessibility of these languages, we present MC^2, a Multilingual Corpus of Minority Languages in China, which is the largest open-source corpus so far. It encompasses four underrepresented languages, i.e., Tibetan, Uyghur, Kazakh in the Kazakh Arabic script, and Mongolian in the traditional Mongolian script. Notably, two writing systems in MC^2 are long neglected in previous corpora. As we identify serious contamination in the low-resource language split in the existing multilingual corpora, we propose a quality-centric solution for collecting MC^2, prioritizing quality and accuracy while enhancing representativeness and diversity. By in-depth analysis, we demonstrate the new research challenges MC^2 brings, such as long-text modeling and multiplicity of writing systems. We hope MC^2 can help enhance the equity of the underrepresented languages in China and provide a reliable data foundation for further research on low-resource languages.Comment: Work in progres

arXiv.org e-Print Archive

Lawyer LLaMA Technical Report

Author: An Zhenwei
Chen Zhibin
Feng Yansong
Huang Quzhe
Jiang Cong
Tao Mingxu
Wu Zirui
Zhang Chen
Publication venue
Publication date: 24/05/2023
Field of study

Large Language Models (LLMs), like LLaMA, have exhibited remarkable performances across various tasks. Nevertheless, when deployed to specific domains such as law or medicine, the models still confront the challenge of a deficiency in domain-specific knowledge and an inadequate capability to leverage that knowledge to resolve domain-related problems. In this paper, we focus on the legal domain and explore how to inject domain knowledge during the continual training stage and how to design proper supervised finetune tasks to help the model tackle practical issues. Moreover, to alleviate the hallucination problem during model's generation, we add a retrieval module and extract relevant articles before the model answers any queries. Augmenting with the extracted evidence, our model could generate more reliable responses. We release our data and model at https://github.com/AndrewZhe/lawyer-llama.Comment: Work in progres

arXiv.org e-Print Archive

An update on the functional roles of long non‑coding RNAs in ischemic injury (Review)

Author: Cao Yanqun
Huang Kai
Jiang Na
Liu Jia
Lu Quzhe
Reilly James
Shang Lei
Shu Xinhua
Yang Baolin
Publication venue: 'Spandidos Publications'
Publication date: 18/05/2022
Field of study

Ischemic injuries result from ischemia and hypoxia in cells. Tissues and organs receive an insufficient supply of nutrients and accumulate metabolic waste, which leads to the development of inflammation, fibrosis and a series of other issues. Ischemic injuries in the brain, heart, kidneys, lungs and other organs can cause severe adverse effects. Acute renal ischemia induces acute renal failure, heart ischemia induces myocardial infarction and cerebral ischemia induces cerebrovascular accidents, leading to loss of movement, consciousness and possibly, life-threatening disabilities. Existing evidence suggests that long non-coding RNAs (lncRNAs) are regulatory sequences involved in transcription, post-transcription, epigenetic regulation and multiple physiological processes. lncRNAs have been shown to be differentially expressed following ischemic injury, with the severity of the ischemic injury being affected by the upregulation or downregulation of certain types of lncRNA. The present review article provides an extensive summary of the functional roles of lncRNAs in ischemic injury, with a focus on the brain, heart, kidneys and lungs. The present review mainly summarizes the functional roles of lncRNA MALAT1, lncRNA MEG3, lncRNA H19, lncRNA TUG1, lncRNA NEAT1, lncRNA AK139328 and lncRNA CAREL, among which lncRNA MALAT1, in particular, plays a crucial role in ischemic injury and is currently a hot research topic

PubMed Central

ResearchOnline@GCU

Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization

Author: Chen Bin
Chen Liwei
Gai Kun
Huang Quzhe
Jin Yang
Lei Chenyi
Lei Xiaoqiang
Liao Chao
Liu An
Mu Yadong
Ou Wenwu
Song Chengru
Tan Jianchao
Xu Kun
Xu Kun
Zhang Di
Publication venue
Publication date: 29/09/2023
Field of study

Recently, the remarkable advance of the Large Language Model (LLM) has inspired researchers to transfer its extraordinary reasoning capability to both vision and language data. However, the prevailing approaches primarily regard the visual input as a prompt and focus exclusively on optimizing the text generation process conditioned upon vision content by a frozen LLM. Such an inequitable treatment of vision and language heavily constrains the model's potential. In this paper, we break through this limitation by representing both vision and language in a unified form. Specifically, we introduce a well-designed visual tokenizer to translate the non-linguistic image into a sequence of discrete tokens like a foreign language that LLM can read. The resulting visual tokens encompass high-level semantics worthy of a word and also support dynamic sequence length varying from the image. Coped with this tokenizer, the presented foundation model called LaVIT can handle both image and text indiscriminately under the same generative learning paradigm. This unification empowers LaVIT to serve as an impressive generalist interface to understand and generate multi-modal content simultaneously. Extensive experiments further showcase that it outperforms the existing models by a large margin on massive vision-language tasks. Our code and models will be available at https://github.com/jy0205/LaVIT

arXiv.org e-Print Archive